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This paper presents a new adaptive control approach using Chebyshev orthogonal polynomials as basis 
functions in a least-squares functional approximation. The use of orthogonal basis functions improves the 
function approximation significantly and enables better convergence of parameter estimates. Flight control 
simulations demonstrate the effectiveness of the proposed adaptive control approach. 

I. Introduction 

In many physical applications, there is no clear certainty about the structure between the input and output of 
a process. This uncertainty is called unstructured. In systems with unstructured uncertainty, the transfer function 
between the input and output is usually not known. Let y(t) € K be the output with an unknown transfer function, 
expressed as 

y = f(x ) (1) 

where x(t) £ 32) C W’ and / (x) G R is an unknown function but assumed to be bounded function in x. 

When the structure of the uncertainty is unknown, function approximation is usually employed to estimate the 
unknown function. In recent years, neural networks have gained a lot of attention in function approximation theory 
in connection with adaptive control. Multi-layer neural networks have the capability of approximating an unknown 
function to within an arbitrary value of the approximation error. The universal approximation theorem for sigmoidal 
neural networks by Cybenko 1 and the Micchelli’s theorem 2 for radial basis functions provide a theoretical justification 
of function approximation using neural networks. The use of multi-layer neural networks can create an additional 
complexity in the back propagation gradient-based training rules. 

Polynomial approximation is a well-known regression technique for function approximation. In theory, as the de- 
gree of an approximating polynomial increases, the approximation error is expected to decrease. However, increasing 
the degree of the approximating polynomial beyond a theoretical limit could lead to oscillations in the approximating 
polynomial due to over-parametrization. Regularization techniques to constrain parameters have been developed to 
prevent over-parametrization. 1 

In this paper, we explore the use of a special class of polynomials, known as Chebyshev orthogonal polynomials, 
as basis functions for function approximation. The Chebyshev polynomials have been shown to provide the “best” 
approximation of a function over any other types of polynomials. 4 The use of the Chebyshev polynomials in the 
context of adaptive control with unstructured uncertainty is demonstrated in this paper. Simulation results demonstrate 
a significant improvement in the effectiveness of Chebyshev polynomials in the adaptive control setting over a regular 
polynomial regression. 
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II. Polynomial Approximation 


Any sufficiently smooth function f (x) £ can be expanded as a Taylor’s series about some x = xo 

/(*)=/ (*o) + (xo) (x-xq) + ^(x- x 0 ) T V 2 * / v (x 0 ) (x - xo) + ■ ■ ■ (2) 

Then /(x) can be represented as 


/ (x) = ©* T <t> (x) — e (x) 


(3) 


where 0* £ R m x R" is a matrix of constant but unknown coefficients, <t> (x) £ R 9 is a vector of regressors in terms of 
monomials of x 


<t>(x) = 


X] x 2 


Xp xj XlX 2 


q— 1 
X\X2 



(4) 


and £ (x) is a function approximation error which depends on x. 
/ (x) is then approximated by a polynomial of ^-degree 


f{x) =Pq(x) =© T< I>(x) 


(5) 


where 0el“x R" is the estimate of 0*. 

The coefficients © can be computed using various least-squares methods such as the batch least-squares, least- 
squares gradient method, or RLS method. Note that since © T <f> (x) is an approximation of an unknown function / (x), 
the approximation error will not be asymptotic regardless whether or not <f> (x) is persistently exciting. 

The Weierstrass’ s theorem 4 states that given any sufficiently smooth function / (x) <E 'tf [a, b] and £o > 0, there exist 
a polynomial p q (x) for some sufficiently large q such that ||/(x) — p q (x) < £o- This means that any sufficiently 

smooth function can be approximated by a polynomial of r/-th degree. Then the function approximation error could 
be made sufficiently small on a compact domain of x such that sup v6 ^ || e (xt) || < £o for all x £ P2) C R". 

There are several types of polynomial approximation of a function. The regular polynomial regression using 
monomials as basis functions is frequently used for function approximation. Orthogonality is a property of a function 
that belongs to a metric space endowed with an inner product. Given two functions g (x) and h (x), then g (x) and h (x) 
are orthogonal to each other if their inner product is zero. That is 

(g(x),h(x))= 0 (6) 

where (., .) is an inner product operator that takes on two real-valued functions and returns with a constant. The inner 
product has a concrete mathematical definition depending on the class of functions. 

A regular polynomial does not possess this orthogonality property. In contrast, certain classes of polynomials 
such as Chebyshev polynomials and Legendre polynomials are orthogonal polynomials. One advantage of an orthog- 
onal polynomial over a regular polynomial is that a low-degree orthogonal polynomial can provide a good function 
approximation with minimal loss of accuracy as compared to a higher-degree regular polynomial. 

Chebyshev polynomials are special polynomial functions that are associated with the solution of a class of Sturm- 
Liouville differential equations 

< i -* 2 )§-''S + ^=° < 7 > 

forx £ [—1,1]. 

This differential equation is known as the Chebyshev differential equation of the first kind. The Chebyshev differ- 
ential equation of the second kind is of the form 

(l -x 2 ) ^-j-3x^-+n(n + 2)y = 0 (8) 

The Chebyshev polynomials of the first kind are given by a generating function 

T n + 1 (x) = 2 xT„ (x) - i (x) (9) 


where Tq (x) = 1 and T\ (x) = x. 

The first several terms of the Chebyshev polynomials are given as follows: 
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( 10 ) 


T 0 (x) = 1 
T\ (x) = x 
T) (x) = 2x 2 — 1 
T 3 (x) = 4x 3 — 3x 
I) (x) = 8x 4 — 8x 2 + 1 


A solution of a Sturm-Liouville differential equation constitutes an orthogonal basis that spans any Hilbert space 
which is a complete, inner product space with a weighted inner product definition 


(g (x) ,h(x))= w (x) g (x) h (x) dx 


( 11 ) 


The Chebyshev polynomials are orthogonal with respect to a weighting function 

1 


w(x) = 


\/ 1 — X 2 


such that 


(7/i (x) , T m (x)) = 


»■/ 


1 T n (x) T m (x) 


Vi — x 2 


0 Hi 

dx= l -jz n = m = 0 


( 12 ) 


(13) 


Any subspace =5^ in an inner product space c € has an orthogonal complement £/ such that their bases completely 
span the inner product space 7f. Therefore 




(14) 


Since the Chebyshev polynomials are orthogonal, they form a complete basis for an real-valued function. This 
implies that any unknown, non-singular function /(x) can be approximated by a Chebyshev polynomial of degree n. 


f (x) = 6 0 T 0 (x) + 01 Ti (x) H h 0 n T n (x) = Oft (x) = 0 T <3> (x) 

7=1 


where the polynomial coefficients are approximated by 

nl / {x) Tj (x) 


0i = 


\J 1 — x 2 


t/x 


•y/l — X" 


:t/x 


(15) 


(16) 


forx € [ — 1,1]- 


III. Least-Squares Estimation 

The input-output transfer function of a system is given by a set of measurement data of y (t) £ R" as a function of 
some independent variable x (?) € K p , expressed as data pairs (x,-, >’,■), i = 1 ,2, ... ,N. Furthermore, suppose the transfer 
function between x(t) and y (?) can be linearly parametrized as 

y = ©* T< t> (x) (17) 

where ©* £ R" 1 x R" is a matrix of constant but unknown coefficients and <t> (x) £ R"' is a bounded regressor (or basis 
function) vector and is assumed to be known. 

Let y (?) be an estimate of y (?) such that 

y = 0 T< t>(x) (18) 

where © is an estimation of ©*. 

Formulating an approximation error e (?) as 

e =y — y = © T< J>(x) — y (19) 
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Consider the following cost function 


^(©) = ^E £ I T e i 


When J (©) is minimized, the approximation error is also minimized. Then y approximates y in a least-squares 
sense. Thus, the parameter identification problem is posed as a functional minimization. 

The necessary condition is given by 


^ = V/® (©) = £ ^t£, T = I> (*/) [<*> T (x t ) © 


,tl<5© T 


-yj =o 


where V/® is called a gradient of J with respect to ©. 

Thus, © can be found by solving the following least-squares regression equation 

©=A ~ l B 


A = £<f>(x,)<I> T (x,) (23) 

i= 1 

e = f o(x,)>7 (24) 

(=1 

and A -1 is assumed to exist. 

Example: Suppose y £ R is a scalar variable which can be approximated as a p-th degree polynomial in terms of 

r £ I as 

p 

y = 0o + 0|X H b 9 p x p = ^ djx^ = © <t> (x) 

j = o 

where © T = 0 O 0, ... Q p and T> (x) = 1 x ... x p 

The least-squares regression equation is expressed as 


where 


A© = B 


A = £<I>(x,)<f> T (x,) = 
i= 1 


1 

M 

T* 



Hi* ■ 

v N r p 

2-i= 1 X/ 

Hi* 

1 Xi • • • xf = 

III* 

Hit • 

yN P + 1 

2-i=l x i 

. ih t . 



yN P + 1 
2-i=l 'S' 

yN 2 p 

2-,=i x,- J 


[ N 


' ih 1 1 r if=i *■ 

„ ^ n | Hi* HiXVi j 

fi = E 0 ^,)>', = . y,-= . = <! E x <>’' 

i=i : : I i=i 


Y P 

2-i=i x ; - 


r p v- 

2-1=1 x i y i 


This least-squares regression method is essentially a polynomial curve-fitting technique. For example, let p = 2, 
then the quadratic curve-fitting coefficients can be found with 



N 

III* 

iht 1 

III * 

A = 

III* 

III t 

Hit B = 

Hi** 


. Hit 

III t 

M 

?*= 

1 

. Hit * . 


When all available data are used in the least-squares regression method, it is sometimes called a batch least-squares 
method. This is usually when there are sufficient data over a given time interval and the estimates of the unknown 
coefficients are not needed immediately at each time step. 


American Institute of Aeronautics and Astronautics 



A. Convex Optimization and Least-Squares Gradient Method 

When the estimates of the unknown coefficients are needed at each time step, © can be estimated recursively using 
each pair of data at each time step. 

Consider the following cost function 

7(©) = ie T e (29) 

The gradient of the cost function with respect to © is given by 

H= v *< e >=(^) £T=<1> « £T (30) 

To determine a least-squares estimation based on a given data pair at each time step, the concept of convex opti- 
mization is now introduced. 

A subset .9' is said to be convex if there exist x, y in S? and a constant a £ [0,1] such that ax + (1 — a)y is also in 
5? . A function / is said to be convex in a convex set -9 J if for every x, y in S? , then 

f(ax+(l-a)y) < af(x) + (l-a)f(y) (31) 


Note that J (©) is convex since 


i [ae + (1 - a)e i] T [ae + (1 — a)ei] = ia 2 e T e + a( 1 - a)e T ei + ^ (1 — a) 2 e, T ei 

= ^a 2 ^e T e-2e T e 1 ^ + ae r E\ + ^ (1 - a) 2 £j T £i (32) 

but a 2 < a and (1 — a) 2 < 1 — a for all a £ [0, 1], so 

^s T e-2s T £i'j +as T s i + ^ (1 - a)£, T £i < a^e T e + (1 - a) ^£j T £i (33) 

if/e < ^’ 1 , i.e., / is differentiable at least once, then 

fiy) >/W + (V/(x)) T (y-x) (34) 

If f £ 'tf 2 , then / is convex if V 2 / > 0 where V 2 / is called the Hessian of /. 

Now consider the minimization of J (©). ©* is said to be a global minimum of J if 

J (©*)</(©) (35) 

This implies that Vy® (©*) =0 and V 2 /® (©*) > 0 since J (©) is twice-differentiable with respect to ©. 

Utilizing Taylor’s series expansion, one writes 

Vy®(©*) = V/®(©*+A©)+V 2 y®(©*+A©)A© + ^(A© T A©) (36) 

Since Vy® (©*) = 0, Vy® (©* + A©) = Vy® (©), and V 2 y® (©* + A©) = V 2 /® (©), then 

A© = - [v 2 y® (©)] ~ 1 vy® (©) (37) 

Equation (37) can be written in discrete-time form as 

0/+1 = ®i - [V 2 /© (©,-)] “ 1 vy© (©,•) (38) 

This is known as a second-order gradient or Newton’s method for convex optimization. It is noted that the inverse 
of the Hessian matrix is generally numerically intensive. So a first-order approximation can be made by recognizing 
that v 2 y® (©) « v 2 y® (©*) = £> o, where £ is a small positive parameter, when © is in the neighborhood of ©* . This 
approximation leads to the well-known steepest descent or first-order gradient method for convex optimization. 

©,+1 = ©, - £Vy© (©,) (39) 
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Now dividing both sides by At and taking the limit as At — > 0 yield 

© = -rV/ 0 (0) (40) 

where T = T t > 0 € R” 1 x M m is a positive definite adaptation rate matrix that effectively replaces JL. This is the 
continuous-time version of the gradient method. 

Returning to the minimization of 7(0) to estimate ©*, the differential form of the least-squares estimation of © 
can be expressed using the gradient method as 

© = -rV/ 0 (©) = -r$ (x) e T (41) 


Notice the resemblance of this least-squares gradient method to a model-reference adaptive law, where the approx- 
imation error £ (t) replaces the tracking error e (t). 

Example: For the example in Section 6.1, the least-squares gradient method is 


© = 

• • s>* ^ 

= r 

/ 

1 X 

2 

X X 

^ + 


' 00 ‘ 
01 


y 

xy 

\ 


■ 


l 

X . 
73 

X 

+ 

• x 2 P 


^ ■ 


x p y 

/ 


B. Persistent Excitation and Parameter Convergence 

Let © (t )=©(?) — ©* be the estimation error, then 

£ = © T <J> (x) © — y = © T <J> (x) (43) 

The least-squares gradient method can be written as 

© = © = -r<f> (x) <1> T (x) © (44) 

Now, choose a Lyapunov candidate function 

V (©) = trace (©Tr' 1 ©) (45) 

Then 

V (©) = 2 trace = -2trace (@ T <f>(x)<l> T (x)©) = -2<1> T (x)©0 T <f>(x) = -2 e T e = -2 1|£|| 2 < 0 (46) 

Note that V (©) can only be negative semi-definite because V (©) can be zero when d>(x) = 0 independent of ©. 
One can establish that V (©) has a finite limit as t — > °° since 


V (t^oo)=v(t 0 )- 2 f 

Jti i 


' dt < ' 


(47) 


which implies 



||£|| 2 <if = V (to) — V (t — > °o) < °o 


(48) 


Therefore, £ (t ) £ ££2 0 £££■ Moreover, since <J> (x) £ ,2) x . by the problem statement, then © (t) £ but there is 
no assurance that © (t ) — + 0 as t — > °° which implies parameter convergence. 

One cannot conclude that V (©) is uniformly continuous since 


V (©) = -4£ T £ = -4e t 


© T< f> (x) + © T <i> (x) — y 


(49) 


is not necessarily bounded because there is no other condition placed on <L (x) and y (t) except for <t>(x) £ and 
y (t ) £ Jzfoo. For V (©) to be uniformly continuous, additional conditions that <t> (x) £ and y ( t ) £ ££„ are required. 

Then, using the Barbalat’s lemma, one can conclude that V (©) — > 0 or £ (t ) — » 0 which also implies that © (t) — > 0 as 
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t — * oo. Note that from Eq. (44) © (t ) — * 0 does not necessarily imply that © (f ) — > 0 since <t> (x) <f> 1 (x) can also tend 
to zero instead of © (t). 

So up to this point, one can only show that the approximation error e (t) can tend to zero if <t> (x) <E but not 
necessarily the estimation error ©(f) since <t> (x) can be a zero signal at some time interval. To examine the issue of 
parameter convergence, suppose for a moment, Eq. (44) is a scalar equation whose solution is 


©(f) = exp 


r <t>(x)<i> T (x)d x 


©to) 


(50) 


Note that x (f ) is an independent variable as a function of f . Then for © (f ) to be exponentially stable which implies 
an exponential parameter convergence, the following condition is required 


4>(x)4> T (x)d/T> al (51) 

for all f > fo and some a > 0. 

This condition is called a persistent excitation (PE) condition which essentially requires an input signal to be 
persistently exciting (PE), that is, a signal that does not go to zero after some finite time when parameter convergence 
has not been reached. Another interpretation of the persistent excitation condition is that for parameter identification 
to converge exponentially, an input signal must be sufficiently rich to excite all system modes associated with the 
parameters to be identified. It should be noted that while persistent excitation is needed for parameter convergence, in 
practice, input signals that are persistently exciting can lead to unwanted consequences such as exciting unknown or 
unmodeled dynamics that can exacerbate stability of a dynamical system. 

Another observation to be made is that if x(t) is a state variable of a closed-loop system, one cannot assume that 
the persistent excitation condition can easily be satisfied. This can be explained as follows: suppose a parameter 
identification is used for adaptation, then closed-loop stability usually implies parameter convergence to the ideal 
values of the unknown parameters. However, parameter convergence requires persistent excitation which depends on 
x (f ) which in turn depends on parameter convergence. This is a circular argument and, therefore, it is difficult to assert 
the PE condition. However, if x(t) is an independent variable, then the persistent excitation condition can be assumed 
to be satisfied. Suppose that this is the case, then the estimation error is given by 

I©(0| < |©(fo)|e“ m ,Vfe [tuh+T),^ >to (52) 



where y = A„„„ (T) is the smallest eigenvalue of T. Thus, 0 U) is exponentially stable with © (t) —> 0 as t — > Hence, 

the parameter convergence is established. It follows that the approximation error is also asymptotically stable (but not 
necessarily exponentially stable because of <f> (x)) with e (t ) — > 0 as t — > <=°. 


C. Recursive Least-Squares 

Consider the following cost function 


J 




£ T ed x 


(53) 


which is the continuous-time version of the cost function for batch least-squares. 
The necessary condition is 


dJ T 


v J@ (®) = — =i *(*)[«> w©-/ 


d X = 0 


from which © is obtained as 


© = 


f <I> (x) <I> T (x) d X 

Jtn 


-I rt 


<t>(x)y T dx 


(54) 


(55) 


assuming the inverse of <f> (x) O 1 (x)dx exists. Note that the matrix <I> (x) <I> T (x) is always singular and is not 
invertible. However, if the PE condition is satisfied, then ff ( t> (x) ( h (x) dx is invertible. 

Introducing a matrix R(t) = R (t) > 0 € R'" x R”' where 


R = 


I <I> (x) <I> (x) 

Jtn 


- 1-1 


d T 


( 56 ) 
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Then 

Upon differentiation, this yields 
From Eq. (56) 

Therefore 


R _1 © = j <P(x)y T di: 

J to 


R- l ®+^—® = <P(x)y 1 
dt v ' 


dR 


dt 


= (x) (x) 


© = — R<t> (x) <F t (x) ® — y T = — R<t> (x) e 


Now since RR 1 = /, then 


Thus 


RR~ l +R- 


dR- 


dt 


(57) 

(58) 

(59) 

(60) 

(61) 


R= -R<P{x)<i> T {x)R (62) 

Both Eqs. (63) and (62) constitute the well-known recursive least-squares (RLS) parameter identification method. 5 
The matrix R is called the covariance matrix and the RLS formula is similar to the Kalman filter where Eq. (62) is a 
differential Riccati equation for a zero-order plant model. Comparing Eq. (41) with Eq. (63), R plays a role of F as 
a time-varying adaptation rate matrix and Eq. (62) is effectively an adaptive law for the time-varying adaptation rate 
matrix. 

Let © (t ) = © (t) — ©* be the estimation error. Since e = © <t> (x), then 


Choose a Lyapunov candidate function 


Then 


© = -M>(x)< F t (x)© 


V (©) = trace ^© T /? *© 


(63) 


(64) 


V (©) = trace ( 2© T /r‘® + © T ^^_© j = trace (-2© T <t>(x)<t> T (x) 0 + ® T <I> (x) <t> T (x)0 


= —trace 


(© t O(x)4> t (x)©) = -e T e = - ||e|| 2 < 0 (65) 


One can establish that V (©) has a finite limit as t 


y(f-oo)=y(f 0 )- / ||e|| 2 <* < ■ 

Jt 0 


( 66 ) 


Therefore, e (t ) € Since <t> (x) £ Jfoc by the problem statement, then © (t) £ S/d, but there is no guarantee 

that © (t ) — > 0 as t — > °° which implies parameter convergence, unless <f> (x) is PE. 

Note that V (©) is not necessarily uniformly continuous since this would require that V (©) is bounded. Evaluating 
V (©) as 


© t 4>(x)+© t 4>(x) 


= — 2e 


V (©) = -2e T e = -2e 5 


Therefore, V (®) is bounded if the following conditions are imposed: 

• O (x) £ J£o° 


© 1 (h (x) O (x) R<F (x) + ® T 4> (x) 


2e T 

-© t 4>(x)T> t (x) 

f 4>(x)<I> (x)dT 

-1 

T>(x) + ® t 4>(x) 



-Jt 0 



( 67 ) 
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f t ‘ o <P( x )<l> T (x) dr 


is invertible which implies <f> (x) is PE 


If these conditions are satisfied, then using the Barbalat’s lemma, it can be shown that £ (t ) — » 0 as t — > In addition, 

© (t ) — > 0 as t — > °° and the parameter convergence is achieved. 

Note that there are various versions of the RLS method. One popular version is the RLS method with normalization 
where the adaptive law for R is modified as follows: 


R = - 


R<P (x) <P T (x) R 

1 + n 2 


( 68 ) 


where 1 + n 2 = 1 + <t> 7 (x) /?<t> (x) is called a normalization factor. 

The time derivative of the Lyapunov candidate function for the RLS method with normalization is 


V (0) = trace ^-2© T <f> (x) O t (x) © + ^ ® ^ = -trace (© T <I> (x) <& T (x)0 (l + 2n 2 )) 

= -e T e (l +2n 2 ) = - ||e|| 2 (l + 2n 2 ) < 0 (69) 


Note that V (©) is more negative with than without normalization. Therefore, the effect of normalization is to 
make the adaptive law for R more stable, but the parameter convergence is slower. 

Another popular version is the RLS method with forgetting factor and normalization which is given by without 
derivation 


R = fiR 


R<t> (x) <J> T (x) R 
1 +n 2 


(70) 


where 0 < p < 1 is called a forgetting factor. 


IV. Adaptive Control with Unstructured Uncertainty 

The RLS method has been used in adaptive control applications and demonstrated to be highly effective in esti- 
mating parametric uncertainty in adaptive control. In a previous study, a hybrid direct-indirect adaptive control with 
the RLS was developed. Other techniques based on the RLS have been recently developed such as the RLS-based 
modification in adaptive control. 7 In this study, the approach is considered to be a direct adaptive control method. 

Consider a second-order SISO system 


y + ay + cy=b[u+f(y)} 


where a, b, and c are known and / (y) is an unstructured uncertainty 
The state-space form of the system is 

x = Ax + B[u + f(y)] 
1 T 


where x = 


y y 


and 


A = 


0 1 
c a 


B = 


A reference model is given by 
where x m 6 l 2 , r 6 R, and 


Am — 


x m — A mX T B nl r 


0 1 
-oy; -2£(0„ 


with £ > 0 and 0 )„ > 0. 

The nominal system is designed to track the reference model with a nominal controller 


(71) 

(72) 

(73) 

(74) 

(75) 


where A — BK X = A,„ and Bk r = B m . 


u = —K x x + k,r 



B J (A— A m ) 


(76) 

(77) 
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(78) 


k r = (Vfi) ' B r B m 

Note that ( B r B ) B is the left pseudo-inverse of B and is equal to 

( B r B ) 1 B r = ^B t (79) 

To accommodate the uncertainty, an adaptive controller is included in the controller 

u = u + u a d (80) 

where the adaptive controller 

u ad = -® T <t>(y) (81) 

is designed to estimate the unstructured uncertainty which can be expressed as 

f(y) = ®* T <S>(y)-e(y) (82) 

where ©* g R p is an unknown constant ideal weight matrix, <f> (y) g R p is a vector of known bounded regressors or 
basis functions, and £ (y) £ R is an approximation error. 

In general, <f> (y) can be any bounded regressor function. However, with the choice of Chebyshev polynomials, 
these regressor functions are also true basis functions with their endowed orthogonality properties. Basis functions 
provide a better approximation of an unstructured uncertainty than non-basis regressor functions. 

Alternatively, an unstructured uncertainty can also be approximated by a neural network 

/(y)=©* T q>(ir T y) (83) 

where 0* g R p+1 and W* € R 2 are unknown constant ideal weight matrices, y = 1 y g R" +1 , 4> (lL* T y) g 

R p+1 . 

Invoking the Weierstrass’s theorem, £ (y) can be made sufficiently small in a compact domain of y(t) such that 


m V y eS> II e Mil < £o Vy g 3> C R by a suitable selection of <f> (y). 

Define a desired plant model as 

x d =A m x + B m r (84) 

Then formulating a plant modeling error as 

£ = x d — x = Ax + Bii — x = Z?0 T <J> (y) + Be (85) 

The RLS adaptive law for estimating © is given by 

0 = -/?<!> (y)e T B^B T B^ * (86) 

R = -riR<&(y)® T (y)R (87) 

where 0 < T] < 1 . 

The estimation error equation for the RLS adaptive law is then obtained as 

© = -RT>(y) <f> T (y) ®B T + £ T Z? T B (b t B^J l = -R<i>(y) <I> T (y)© + £ T (88) 

The tracking error equation can be expressed in terms of the approximation error as 

e = x m — x = x m — id + Xd — x = A m e + £ = A m e + B® J <A> (y) + Be (89) 

Now, choose a Lyapunov candidate function 

V (e,©) =e T Pe + trace (90) 
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Then V is 


V (e, ©) = -e'Qe + 2 e~P£ + trace 2 & T R + © 


dR 


dt 


-0 


= -e T ge + 2e T Pe + trace ^-2© T>(y) T> (y)© + e T + r; © T <1> (y) 4> (>•) © j 

= -e J Qe + 2 e T Pe - (2 - tj ) <& T (>■) 0© t T> (>■) - 2e T 0 T T> (>■) 

< —e T Qe + 2e J Pe — < f >T (y) 0© T <t> (y) — 2e T © T <t> (y) (91) 


Note that 


Therefore 


= 4> ' (y) 0© 1 4> (>■) + 2e T 0 T <t> (>■) + e 1 e 


llfill 


V (e,©) < -X m in (Q) ||e|| 2 + 2 |!Pj| ||e|| ||e|| — ■ 


I B 


\2 '°0 


(92) 


(93) 


where e 0 = sup^ || e ( 
Define a compact set S? 


y=\ M) I Kin ( Q ) \\e\\ 2 -2X max (P) ||e|| ||e|| + < £o 2 


B 


a - c ° 


(94) 


V (e,©) > 0 inside of S? but V (e,©) < 0 outside 5? . Therefore, e(t) £ «5fc» and e(f) £ Jzfe*,. Define £o = 

su P*e® II £ W II - then 

,2) 


Ikll > r = 


hnax ( P ) e 0 + ^-nmx (P) e 0 ” 1 " (6) ( 


e o - £ o/H fi l 


^ min (Q) 

It can be shown that e (t ) is uniformly ultimately bounded with an ultimate bound 


(95) 


P = 


I kmax (P) 


h min (P) 

Example: Consider a first-order scalar system with unstructured uncertainty 

x = ax + b [u + f (x)] 

where a and /(x) are unknown, but b = 2. For simulation purpose, a = 1 and /(x) = 0.2 ^sin2x + cos4x + e T j . 
The reference model is given by 

x m = a n ,\ m T b m r 

where a m = — 1, b m = 1, and r (t) = sinf. 

Since / (x) is unknown, a regular polynomial of ^-degree is used to approximate / (x) as 


(96) 


f (x) = a® + cnx -\ 1- a q x q — e (x) = ©* T 4> (x) — £ (x) 


where a,-, i = 0, 1, ... ,q are constant unknown coefficients. 

The controller is designed as 

u = k x ( t ) x + k,-r — © T (f ) <f> (x) 

where k x (t) and © (t) are computed by the least-squares gradient adaptive laws as 


k x = 


yx£ 


0 = — 


r<i> (x) £ 
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with k x (0) = 0, © (0) = 0, y = 1, and T = I, where 


e = ax + bu — x 


a = a m — bk x 
u = k x x + k r r 

The tracking error for q = 1,2, 3, 4 are shown in the following plots. Note that the tracking error improves for 
q> 2. Even the tracking error improves, the function f (x) does not seem to be well approximated as shown in Fig. 3. 
This is also due to the poor convergence of k x and ©. 



Fig. 1 - Tracking Error due to Feast-Squares Gradient Method with regular Polynomial Approximation 



Fig. 2 - k x and © due to Feast-Squares Gradient Method with regular Polynomial Approximation 
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Fig. 3 - Function Approximation at t = 100 due to Least-Squares Gradient Method with regular Polynomial 
Now suppose the Chebyshev orthogonal polynomials are used instead. Then 

/ (x) = a 0 + aiTi (x) H b a q T q (x) - £ (x) = 0*<F (x) — £ (x) 

The simulation results are as shown. For q = 1, the result is the same as the regular polynomial. However, it can 
be seen that the tracking error significantly reduces for q = 2 with the Chebyshev polynomial and is even smaller than 
that for <7 = 4 with the regular polynomial. For q = 4, the Chebyshev polynomial approximation results in a very small 
tracking error. The unknown function is very well approximated by a 4 th -degree Chebyshev polynomial as shown in 
Fig. 6. 





Fig. 4 - Tracking Error due to Least-Squares Gradient Method with Chebyshev Polynomial Approximation 
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Fig. 5 - k x and © due to Least-Squares Gradient Method with Chebyshev Polynomial Approximation 



Fig. 6 - Function Approximation at t = 100 Least-Squares Gradient Method with Chebyshev Polynomial 

In contrast, let us compare the least-squares adaptive control with the standard model-reference adaptive control 
(MRAC). The MRAC update laws are given by 

kx = yxeb 
© = T <t> (x) eb 

where e = x m — x. 

Figure 8 illustrates parameter convergence of MRAC. As noted, the tracking error is not as good with the MRAC 
as with the least-squares gradient method. The parameters k(t) and © (/) are more oscillatory. The function approxi- 
mation by the MRAC adaptive laws is poorer than that by the least-squares gradient method. Furthermore, for systems 
with unstructured uncertainty, MRAC is known to be non-robust since the parameter estimation error is not necessarily 
bounded. Therefore, robust modification or a projection method must be used to ensure that the parameter estimation 
error is bounded. 
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Fig. 7 - Tracking Error due to MRAC with Chebyshev Polynomial Approximation 



Fig. 8 - k x and © due to MRAC with Chebyshev Polynomial Approximation 



Fig. 9 - Function Approximation at t = 100 due to MRAC with Chebyshev Polynomial 
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V. Flight Control Application 


Consider the short-period dynamics of an aircraft with unstructured uncertainty f (a) as a function of the angle of 
attack due to nonlinear aerodynamics 


•8 -CD 

— 




Za 

u 
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0 

0 


M a 


MgZg 



a 



0 

+ 


. 7 . 



Z 8 e 


M Se 


MaZ Se 


[5e + /(°0] 


(97) 


where Z a , Z$ e , M a , Ma, M q , and M§ e are stability and control derivatives; and u is the trim airspeed. 

A pitch attitude controller is designed to track a desired second-order pitch attitude dynamics according to 

0m 2 0 (On 0m ' (O n (ha - (On 0 C 


(98) 


where (O n = 1 .5 rad/sec and £ = 0.85 are the desired natural frequency and damping ratio of the pitch attitude response, 
and 0 C is the pitch attitude command.. 

The pitch rate equation is written as 


0- M, 


MryZn 


a-(M q + Ma)0= (M Se + 


MaZg 


[< 5 , + /(«)] 


The elevator input is designed with the following proportional-derivative (PD) control law 

§ e = -k a a-k g (0 - 0 C ) — k q q — © T <t> (a) = -K x x + k g G c - © T <f>(a) 


where x = 


a 0 q 


,K X = 


kg k ( . 


, and 


M n 


MgZa 

IVi /V -I - 

= MqZg 

M Se + ^ 


(99) 


( 100 ) 


( 101 ) 


kg = 


<o: 


M Se 


M aZg 


kq — 


2 £ (Oil -\-Mq + Ma 


Ms,, 


M aZg e 


The numerical model of the short-period dynamics is given by 
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( 102 ) 

(103) 


For simulation purpose, the unstructured uncertainty that represents nonlinear aerodynamics is described by 


f{a) = 0.1 cos a 3 — 0.2 sin 10a — 0.05e 


The feedback gain is computed to be K x = 
then chosen to be the reference model as 


lT 


0.7616 -0.6365 -0.5142 


. The nominal closed-loop plant is 
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The plant modeling error is computed as e = xj — x = A m x + B m r — x, assuming that the angle of attack rate and 
pitch acceleration are measurable from the vertical acceleration sensors. The uncertainty is modeled with the first four 
terms of the Chebyshev basis polynomials 

© T 4> (a) = 01 + e 2 a + 03 (2a 2 - l) + 0 4 (4a 3 - 3a) 

The RLS parameter estimation is computed by with rj = 0, which effectively is a least-squares gradient method, 
and 77 = 0.2. The covariance matrix is chosen to be R = 201. The aircraft longitudinal responses for rj = 0 and 77 = 0.2 
are as shown in Figs. 10 and 11. 



Fig. 10 - Aircraft Response with Least-Squares Gradient Adaptive Control (rj =0) 
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Fig. 1 1 - Aircraft Response with RLS Adaptive Control (77 = 0.2) 

It can be seen that the least-squares gradient method (RLS with 77 = 0) results in a very good tracking performance, 
but the RLS method with 17 = 0.2 exhibits poor performance. This is expected as the rate of parameter convergence 
for the RLS is proportional to — (2 — 77 ) ||e || 2 according to the Lyapunov analysis. However, the slow parameter 
convergence of the RLS can improve stability robustness of adaptive control in the presence of time delay or unmodeled 
dynamics, as will be shown later. 

For comparison, the parameter estimation is computed by the standard MRAC method using the same Chebyshev 
basis polynomials according to 

© = — r<I> (a)e T PB (104) 

where e = x m — x. 

In addition, instead of using the Chebyshev basis polynomials, a two-layer neural network with the sigmoidal 
activation function is used to approximate the unstructured uncertainty as 

f(a) = ® T <t>(w T a) (105) 
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where a = 


T 


1 a 


and W is the sigmoidal function. 


The neural network adaptive control is specified by the following adaptive laws 

© = -r 0 4> 1 a'j i 


I e T PB 


W = -T w ae J PBV 


/T 




W T a 


(106) 

(107) 


where © T = 


Vo V 1 


The aircraft responses with MRAC (r = r® = rV = 10/) using the Chebyshev polynomial and the neural network 
are as shown in Figs. 12 and 13. 



Fig. 12 - Aircraft Response with MRAC 
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Fig. 13 - Aircraft Response with Neural Network MRAC 

The responses both exhibit initial high frequency oscillations which are indicative of incipient instability, even 
though the subsequent tracking performance is very good. The neural network adaptive control has much more pro- 
nounced high frequency oscillations which are due to the weights initialization with random numbers. 

To illustrate the issue of robustness and show that the RLS is actually better able to handle a time delay or un- 
modeled dynamics than the least-squares gradient method or MRAC, a numerical evidence of the time delay margin 
is computed for each of the four adaptive laws. The results are shown in the following table: 


Adaptive Law 

Numerical Evidence of Time Delay Margin 

Least-Squares Gradient 

60 ms 

RLS with rj = 0.2 

260 ms 

MRAC 

10 ms 

Neural Network MRAC 

60 ms 
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Table 1 - Estimated Time Delay Margins 

The RLS method has the best time delay margin than the other methods. The standard MRAC has very poor 
robustness which is a well-known fact . 8 Generally, the standard MRAC has to be modified to improve robustness 
using the well-known cr-modification 5 and e-modification y or the recently developed optimal control modification 10 
and adaptive loop recovery . 1 1 

The aircraft responses due to a 60-ms time delay for the least-squares gradient method, RLS with 77 = 0.2, and 
neural network MRAC are illustrated in Figs. 14, 15, and 16. The aircraft response due to a 10-ms time delay for 
the MRAC is plotted in Fig. 17. As can be seen, the least-squares gradient method maintains a very good tracking 
performance even with a 60-ms time delay. Both the MRAC and neural network MRAC exhibit high frequency 
oscillations. The RLS method with 77 = 0.2 exhibits low frequency transients even though it is much more robust 
than the other three adaptive laws. Thus, if the time delay is not too large, the least-squares gradient method seems to 
perform the best among the adaptive laws. 



t, sec 


Fig. 14 - Aircraft Response with Least-Squares Gradient Adaptive Control (77 = 0) with 60-ms Time Delay 
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Fig. 15 - Aircraft Response with RLS Adaptive Control ( 77 = 0.2) with 60-ms Time Delay 
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Fig. 16 - Aircraft Response with Neural Network MRAC (rj = 0.2) with 60-ms Time Delay 



Fig. 17 - Aircraft Response with MRAC (T] = 0.2) with 10-ms Time Delay 


VI. Conclusions 

This paper presents an adaptive control method for systems with unstructured uncertainty. The adaptive control 
method uses Chebyshev orthogonal polynomials as basis functions to approximate the unstructured uncertainty. The 
Chebyshev polynomials have many desirable features in function approximation and can be shown to be the “best” 
polynomial function approximation. A recursive least-squares adaptive control method is developed for second-order 
systems using Chebyshev polynomials as basis functions for parameter estimation. The adaptation is driven by a plant 
modeling error as opposed to the usual tracking error in model-reference adaptive control. Simulations demonstrate 
the superior performance of Chebyshev polynomials in an adaptive control setting over regular polynomials and even 
neural networks. The least-squares gradient method demonstrates to outperform both the recursive least-squares adap- 
tive control and the standard, unmodified model-reference adaptive control. On the other hand, recursive least-squares 
adaptive control is shown to be much more robust to time delay and unmodeled dynamics than all the other adaptive 
control methods being studied. However, this robustness comes at an expense of tracking performance. Thus, in 
practice, the least-squares gradient method may strive a better balance between tracking performance and robustness. 
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